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INTRODUCTION: 

Breast  cancer  is  the  second  most  common  form  of  cancer  in  women,  striking  1  out  of  8  women  in 
their  lifetime.  Ovarian  cancer  strikes  fewer  women  but  is  generally  at  an  advanced  stage  at  the  time  of 
detection.  Both  diseases  are  controlled  by  multiple  genetic  defects,  suggesting  the  involvement  of  many 
different  genes,  including  tumor  suppressors.  According  to  the  two-hit  model  of  Knudson,  both  alleles 
encoding  for  a  tumor  suppressor  must  be  lost  or  inactivated  in  order  for  cancer  to  develop.  Based  on  this 
model,  loss  of  heterozygosity  (LOH)  of  alleles  from  tumor  tissue  has  been  used  to  suggest  the  presence  of 
potential  tumor  suppressor  genes. 

The  short  arm  of  chromosome  17  is  one  of  the  most  frequently  altered  regions  in  human  breast  and 
ovarian  cancer.  One  locus  of  high  allelic  loss  is  at  17pl3.1,  and  contains  the  tumor  suppressor  gene,  TP  53. 
However,  we  and  others  have  shown  a  second  region  of  LOH  distal  to  the  TP  5  3  gene,  at  17pl  3.3,  in  breast 
tumors  and  ovarian  tumors.  Genomic  abnormalities  involving  1 7pl  3.3  has  also  been  reported  in  primitive 
neuroectodermal  tumors,  carcinoma  of  the  cervix  uteri,  medulloblastoma,  osteosarcoma,  astrocytoma  (22), 
and  acute  myeloid  leukemia  and  myleodysplastic  syndromes,  suggesting  that  a  gene(s)  on  17pl3.3  may 
play  a  role  in  the  development  of  a  wide  variety  of  neoplasms,  including  breast  and  ovarian  cancer. 

We  have  previously  defined  a  minimum  region  of  allelic  loss  (MRAL)  on  chromosome  17pl 3.3  in 
genomic  DNA  from  ovarian  tumors  and  breast  tumors.  Positional  cloning  and  sequencing  techniques 
revealed  two  genes  in  the  MRAL,  referred  to  as  OVCA1  and  OVCA2,  which  overlap  one  another  in  the 
MRAL,  and  have  one  exon  in  common  (Figure  1).  Since  translation  of  OVCA1  does  not  proceed  into  the 
shared  exon,  the  genes  encode  for  completely  distinct  proteins.  The  function  of  OVCA1  and  OVCA2  are 
unknown  and  their  potential  role  in  breast  and  ovarian  oncogenesis  has  been  a  major  focus  of  our  studies. 

BODY: 

Progress  Report 

Year  4 

Months  37-48.  Specific  Aim  3;  Complete  cloning  and  sequencing  of  clones  that  interact  with 
OVCA1,  initiate  characterization  of  function  of  these  new  proteins  (if  not  already  known);  Evaluate  effect 
of  OVCA2  expression  alone  or  in  combination  with  OVCA1  on  morphology  and  cell  growth,  initiate 
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mutational  studies  of  the  OVCA1  homologue  in  yeast,  establish  transgenic  mice  with  regulatable  OVCA1 
expression  vectors,  initiate  gene  "knockout"  studies  in  immortalized  HOSE  cells. 

OVCA1  protein  inter  actors 

OVCA1  codes  for  a  highly  conserved  protein  with  no  known  function.  To  help  determine  the 
function(s)  of  OVCA1,  we  used  a  yeast  two-hybrid  screening  approach  to  identify  OVCA1 -associating 
proteins.  One  such  protein,  which  we  initially  referred  to  as  BOV-1  (Binder  of  OVCA1-1)  is  173  or  174 
amino  acid  in  length  and  appears  to  be  a  new  member  of  a  highly  conserved  RNA-binding  motif  (RBM) 
protein  family,  that  is  highly  conserved  evolutionarily.  Northern  blot  analysis  revealed  that  BOV-1  is 
ubiquitously  expressed  and  that  three  distinct  mRNA  species  are  expressed,  a  1  kb,  a  3.2  kb,  and  a  5.8  kb 
transcript.  The  1-kb  transcript  is  the  most  abundant  and  is  expressed  at  high  levels  in  the  testis,  heart, 
placenta,  spleen,  thymus,  and  lymphocytes.  Using  fluorescence  in  situ  hybridization  (FISH)  and  the  5.8-kb 
cDNA  probe,  we  determined  that  BOV-1  maps  to  both  chromosome  5ql  3-14  and  to  chromosome  14q22- 
23.  Further  sequence  analysis  determined  that  the  gene  coding  the  1  kb  and  the  3.2  kb  transcripts  (hereafter 
referred  to  as  RBM8A )  maps  to  14q22-23  whereas  a  second  highly  related  gene  coding  for  the  5.8  kb 
transcript  resides  at  chromosome  5ql3-14  (hereafter  referred  to  as  RBM8B).  The  predicted  proteins 
encoded  by  RBM8A  and  RBM8B  are  identical  except  that  RBM8B  is  15  amino  acids  shorter  at  its  N- 
terminus.  Molecular  modeling  of  the  RNA  binding  domain  of  RBM8A  and  RBM8B,  based  on  homology 
to  the  sex-lethal  protein  of  Drosophila,  identifies  conserved  residues  in  the  RBM8  protein  family  that  are 
likely  to  contact  RNA  in  a  protein-RNA  complex.  The  conservation  of  sequence  and  structure  through  such 
an  evolutionary  divergent  group  of  organisms  suggests  an  important  function  for  the  RBM8  family  of 
proteins.  Further  details  of  these  studies  can  be  found  in  the  appended  publication  (Salicioni  et  al.,  2000). 

OVCA1  isoforms 

We  have  recently  found  that  OVCA1  exists  in  at  least  two  forms:  a  48  kDa  and  a  50  kDa  protein 
(Figures  2  and  3).  Subcellular  fractionation  studies  indicate  that  p50OVCA1  localizes  primarily  to  the 
cytoplasm  whereas  p48OVCA1  is  predominantly  found  in  the  organelle  fraction  with  some  in  the  nuclear 
fraction  (Figure  4).  Immunohistochemical  analysis  of  normal  ovaries  and  ovarian  tumors  using  p50OVCA1 
or  p48OVCA)  specific  antibodies  demonstrated  strong  nuclear  and  cytoplasmic  staining  in  the  epithelial  cells 
of  the  normal  ovaries  and  borderline  tumors  for  both  antibodies.  In  contrast,  p50OVCAI  was  observed 
primarily  in  the  nucleus,  with  little  or  no  cytoplasmic  staining  in  the  vast  majority  of  malignant  ovarian 
tumor  cells.  Interestingly,  no  nuclear  or  cytoplasmic  staining  was  detected  in  these  adenocarcinomas  using 
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antibodies  exclusive  for  p48OVCA1.  Furthermore,  there  was  not  apparent  correlation  with  OVCA1 
expression  and  cellular  proliferation  (as  determined  by  ki67  staining)  and  BRCA1  levels  in  these  tumors.  2- 
D  gel  western  blotting  revealed  that  OVCA1  is  extensively  modified  post-translationally  in  tumor  cells 
leading  to  a  dramatic  shift  in  pi  from  ~9  to  ~5.5  (Figure  5).  These  aberrant  modifications  result  in  the 
p50OVCA1  form  and  loss  of  immunoreactive  p48OVCA1.  Studies  are  underway  to  evaluate  a  large  panel  of 
breast  and  ovarian  tumors  for  postranslational  modifications  of  OVCA1  and  to  determine  the  means  by 
which  OVCA1  is  modified.  Overall,  our  studies  indicate  that  altered  expression  and/or  post-translational 
modifications  of  OVCA1  may  be  associated  with  the  development  of  malignant  tumors  of  the  ovary  and 
breast  and  suggest  a  potentially  new  mechanism  for  the  inactivation  of  tumor  suppressors  in  cancer. 

As  an  initial  step  in  determining  the  residue(s)  that  is  potentially  modified,  we  have  randomly  mutated 
the  OVCA1  cDNA  by  passaging  the  cDNA  expression  vector  through  mutator  bacteria  (XLI-RED; 
Stratagene).  As  our  initial  screen  we  searched  for  a  cDNA  encoding  a  mutant  protein  that  would  be 
detected  using  both  N-terminal  antibody  (TJ132)  and  C-terminal  antibodies  (FC22).  We  have  found  that 
p50OVCA1  is  detected  using  TJ132  while  the  p48OVCA1  is  detected  with  FC22  (as  indicated  above).  When 
we  modified  the  protein  by  adding  a  HA-tag  we  are  able  to  detect  the  p50OVCAI  with  both  antibodies.  We 
have  very  recently  completed  our  initial  screen  of  —330  clones  for  reactivity  to  both  antibodies.  We 
identified  on  clone  that  encoded  for  a  protein  of  potential  significance.  Studies  are  underway  to  determine 
what  residue  is  mutated  and  how  this  mutation  leads  to  reactivity  with  the  FC22  antibody.  Furthermore, 
we  are  determining  whether  overexpression  of  the  mutant  is  capable  of  suppressing  tumor  cell  growth  as 
compared  to  wild-type  OVCA1. 

Generation  of  Ovca-/+  and  Ovca-/-  mice  (update  on  progress  and  change  in  direction) 

As  indicated  in  the  previous  progress  report,  we  had  recently  cloned  the  mouse  Ovca  gene  locus.  We 
determined  the  intron/exon  boundaries  for  the  entire  murine  Ovcal  gene  and  determined  that  the  deletion  of 
exon  10  through  intron  12  would  produce  non-functional  Ovcal  and  Ovca2  proteins.  Therefore,  we  chose 
to  selectively  remove  exons  10-12  of  Ovcal  and  exon  1  of  Ovca2  to  give  us  the  best  chance  of  observing  a 
phenotype  in  the  mouse,  since  other  open  reading  frames  exist  in  this  region  (see  previous  progress  report). 
Furthermore,  if  future  studies  warrant,  each  gene  could  be  selectively  “knockout”  to  determine  if  one  or  all 
contribute  to  tumor  development. 
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We  have  designed  and  assembled  our  targeting  construct,  which  should  result  in  the  recombination 
shown  below  (Figure  6).  As  can  be  seen  in  the  diagram,  we  have  fine  mapped  the  region  and  have  already 
derived  probes  that  can  be  used  to  tell  us  if  there  is  successful  integration.  A  Dralll  digest  of  DNA 
combined  with  Probe  A  should  yield  a  ~6.5  kb  fragment  if  there  is  correct  insertion  versus  the  normal  ~6.0 
kb  germline  fragment.  Note  that  Probe  A  is  derived  from  outside  the  targeting  construct  so  that  it  will  only 
detect  either  the  recombinant  event  or  the  germline  fragment  but  not  the  vector  itself.  If  there  is  an  ES  cell 
clone  that  appears  to  be  correct,  we  can  then  verify  this  integration  by  an  Eagl  digest  on  the  DNA 
combined  with  Probe  B  for  the  Southern  blot.  This  will  yield  a  ~2.8  kb  fragment  for  a  homologous 
recombination  event  vs.  a  >16  kb  germline  band.  Through  Genome  Systems  we  obtained  161  ES  cells 
selected  for  G418  resistance.  At  the  time  of  this  report,  none  of  the  ES  cells  showed  proper  integration  of 
the  KO  vector  as  determined  by  Southern  blotting  and  PCR  analyses.  A  second  round  of  transfections  was 
initiated  on  August  15th  through  Incyte  and  ES  cells  will  be  available  on  or  around  September  1,  2000  for 
screening. 

KEY  RSEARCH  ACCOMPLISHMENTS: 

-Reported  the  identification  and  structural  analysis  of  human  RBM8A  and  RBM8B:  two  highly  conserved 
RNA-binding  motif  proteins  that  interact  with  OVCA1 . 

-Determined  that  OVCA1  is  expressed  in  two  forms:  a  48  kDa  and  a  50  kDa. 

-Identified  that  GRP  78  interacts  with  a  C-terminal  bait  of  OYCA1  in  a  yeast  two-hybrid  trap  system. 
-Created  a  knockout  vector  for  the  Ovca  locus  in  mice. 

-Initiated  screening  for  chimeras. 

REPORTABLE  ACCOMPLISHMENTS  (related  to  the  tasks  outlined  in  the  approved  SOW): 

Schultz,  D.C.,  Vanderveer,  L.,  Berman,  D.B.,  Wong,  A.J.,  and  Godwin,  A.K.  Identification  of  two 
candidate  tumor  suppressor  genes  on  chromosome  17pl3.3.  Cancer  Res,  56:1997-2002,  1996. 

Schultz,  D.C.,  A.H.  Prowse,  A.H.,  Godwin,  A.K.  Characterization  of  OVCA1,  a  novel  tumor 
suppressor:  evidence  for  a  role  in  the  development  of  human  epithelial  tumors.  American  Journal  of 
Human  Genetics  61:443,  1997 
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CONCLUSIONS: 

In  order  for  future  therapies  to  be  developed  for  the  fight  against  cancer  it  is  important  to 
understand  the  basic  molecular  mechanisms  that  give  rise  to  a  specific  cancer  type.  The  fundamental 
mechanisms  underlying  the  genetic  basis  of  cancer  are  slowly  being  defined  and  involve  alterations  in  genes 
which  have  been  classified  into  three  general  categories:  (i)  protooncogenes  are  involved  in  growth 
promotion  and  the  defects  leading  to  cancer  are  a  gain  of  function;  (ii)  tumor  suppressor  genes  are  negative 
regulators  of  growth  and  a  loss  of  function  gives  rise  to  cancer;  and  (iii)  DNA  repair  genes  are  involved  in 
maintaining  the  fidelity  of  the  genome  and  altered  function  can  lead  to  increase  rates  of  mutations  in  both 
classes  of  cancer-causing  genes.  Cancer  is  a  multistep  process  that  involves  alterations  in  many  specific 
genes.  The  normal  cell  has  multiple  independent  mechanisms  that  regulate  its  growth  and  differentiation 
and  several  separate  events  are  required  to  override  these  control  mechanisms.  Progress  is  now  being  made 
in  isolating  these  genes  and  the  proteins  they  encode  for,  determining  the  normal  cellular  functions  of  the 
proteins  and  in  investigating  the  mechanisms  of  tumorigenesis. 

Breast  cancer  is  a  very  common  disease,  causing  about  10%  of  deaths  in  women  in  the  Western 
World,  while  ovarian  cancer  is  the  number  one  gynecologic  killer  in  the  United  States  with  over  25,000 
diagnosed  cases  and  14,500  deaths  in  1999.  Molecular  genetic  analysis  of  these  tumors  has  revealed  many 
genetic  aberrations  that  may  represent  important  steps  in  tumor  development.  To  understand  the  genetic 
pathways  underlying  breast  and  ovarian  tumor  development,  it  is  necessary  to  identify  the  genes  affected 
by  these  genetic  aberrations  and  establish  any  correlations  between  disruption  of  their  function  and  tumor 
phenotype. 

Chromosome  17  frequently  shows  loss  of  heterozygosity  (LOH)  in  both  breast  and  ovarian 
carcinomas.  In  addition,  re-introduction  of  chromosome  17  fragments  into  breast  cancer  cell  lines  has  been 
shown  to  suppress  tumorigenicity.  Therefore,  inactivation  of  tumor  suppressor  genes  on  chromosome  1 7 
appears  to  be  a  critical  event  in  the  pathogenesis  of  breast  cancer  as  well  as  other  cancers.  Although  TP  5  3 
at  chromosome  17pl3.1  is  involved  in  the  pathogenesis  of  breast  and  ovarian  cancer,  our  LOH  mapping 
studies  in  breast  and  ovarian  carcinomas  have  defined  a  region  distal  to  TP53,  at  1 7pl  3.3,  thought  to  harbor 
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a  tumor  suppressor  gene  (Godwin  et  al.,  1994,  Schultz  et.  al,  1996).  New  genes,  OVCA1  and  OVCA2,  have 
been  identified  on  chromosome  1 7pl  3.3,  in  this  critical  region  of  allelic  loss  (Schultz  et.  al.,  1 996).  OVCA1 
is  composed  of  12  coding  exons  and  one  non-coding  exon,  while  OVCA2  is  composed  of  two  exons:  a 
unique  exon  1,  and  an  exon  2  which  comprises  part  of  the  3’  untranslated  region  of  OVCA1.  Thus,  the  two 
genes  are  overlapping,  but  their  protein  products  are  completely  distinct. 

Much  of  our  focus  during  the  past  few  years  has  been  on  trying  to  uncover  clues  about  the  function 
of  OVCA1.  We  have  found  that  OVCA1  is  highly  conserved  and  exists  in  at  least  three  forms;  a  48,  a  50, 
and  an  85-kDa  protein.  Evidence  suggests  that  the  85-kDa  form  is  encoded  by  an  alternatively  spliced  form 
of  OVCA1  and  that  the  48-  and  50-kDa  forms  are  the  result  of  posttranslational  modifications.  Subcellular 
fractionation  studies  indicate  that  p50OVCA1  localizes  primarily  to  the  cytoplasm  whereas  p48OVCA]  is 
predominantly  found  in  the  organelle  fraction  with  some  in  the  nuclear  fraction.  Western  blot  analysis 
revealed  that  p50OVCA1  levels  are  reduced  or  are  absent  in  >30%  of  tumors  examined  when  compared  to 
extracts  from  normal  cells  and  tissues,  and  that  p85OVCA1  is  rarely  detected  in  tumors.  Somatic  mutations 
have  been  detected,  but  are  rare  in  OVCA1.  Furthermore,  two  germline  missense  mutations  have  been  found 
in  breast  cancer-prone  women  who  have  tested  negative  for  a  BRCA1  or  a  BRCA2  mutation.  Attempts  to 
create  breast  and  ovarian  cell  lines  that  stably  over-express  the  p50  form  of  OVCA1  have  generally  been 
unsuccessful.  The  clones  that  do  express  exogenous  p50OVCA1  do  so  at  very  low  levels,  and  have 
dramatically  reduced  rates  of  proliferation,  an  increased  proportion  of  the  cells  in  the  Gi  fraction  of  the  cell 
cycle,  and  decreased  levels  of  cyclin  D,  which  may  be  caused  by  an  accelerated  rate  of  cyclin  D  degradation 
(Bruening,  W.,  et  al.,  1999).  Reversion  of  these  cells  to  a  more  rapid  growth  phenotype  is  accompanied  by 
complete  loss  of  expression  of  exogenous  OVCA1.  Screens  for  proteins  that  potentially  interact  with 
OVCA1  have  uncovered  several  known  and  some  unidentified  proteins,  including  a  novel  RNA  binding 
protein  (BOV-1/RBM8A)  (Salicioni,  A.M.,  et  al.,  2000).  The  significance  of  this  interaction  is  being 
evaluated  and  it  is  known  that  in  eukaryotic  cells,  different  classes  of  RNA  are  synthesized  in  the  nucleus, 
including  messenger  RNA  (mRNA),  ribosomal  RNA  (rRNA),  transfer  RNA  (tRNA)  and  small  nuclear 
RNA  (snRNA),  and  must  be  actively  exported  to  the  cytoplasm.  After  the  transcription  of  tRNA  genes, 
the  resulting  RNAs  undergo  numerous  changes  before  a  mature  translation-competent  species  is  produced. 
These  changes  have  been  found  to  include  terminal  processing,  intron  splicing,  editing,  deamination  and 
addition  on  the  nucleotide  level  (see  review  by  (Ibba  and  Soil,  1999)),  many  of  which  are  mediated  by 
RNA-binding  proteins  (RBPs)  and  by  small  RNAs  as  stable  RNP  complexes.  The  most  notable  feature  of 
the  RBPs  is  the  presence  of  one  or  more  copies  of  the  RRM,  the  most  widely  found  and  best-characterized 
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RNA-binding  motif  (Bimey,  et  al.,  1993;  Burd  and  Dreyfuss,  1994a).  We  have  determined  that  RBM8A 
has  only  one  copy  of  this  RRM  with  one  set  of  the  two  consensus  nucleic  acid  binding  motifs,  RNP-1  and 
RNP-2,  which  are  characteristic  of  the  heterogeneous  nuclear  ribonucleoprotein  (hnRNP)  family  of 
proteins. 

The  RRM  is  the  only  RNA-binding  motif  for  which  detailed  structural  information  is  available.  The 
structures  of  several  RRM  proteins  have  been  determined,  and  all  consist  of  the  same  basic  protein  fold. 
These  include  the  NH2-terminal  RBD  of  U1  snRNP  A  (Tang  and  Rosbash,  1996),  the  U1  domain  of  hnRNP 
Al  (Xu,  et  al.,  1997),  the  single  RBD  of  hnRNP  C  (Gorlach,  et  al.,  1994)  ,  the  tandem  RRMs  of  the  Sex- 
lethal  protein  (SXL)  (Crowder,  et  al.,  1999)  and  the  poly(A)-binding  protein  (PABP)  (Deo,  et  al.,  1999).  On 
one  hand,  SXL  governs  sexual  differentiation  and  dosage  compensation  in  Drosophila  melanogaster  by 
binding  specific  RNA  transcripts,  while  it  is  known  that  the  mRNA  poly(A)  tail  is  an  important  contributor 
to  both  translation  initiation  and  mRNA  stabilization/degradation  (Sachs  and  Wahle,  1993).  We  have  built 
models  of  RBM8A  based  on  the  Sxl  and  PABP  structures.  These  models  identify  residues  on  the  surface  of 
the  sheet  that  are  likely  to  bind  single-stranded  RNA,  and  share  identity  in  other  RNA  binding  domains  with 
the  same  protein  fold.  Since  many  of  these  interactions  make  contact  with  RNA  bases,  mutations  of  these 
residues  are  likely  to  alter  the  binding  affinity  of  RBM8A  for  its  natural  RNA  ligand. 

In  contrast  to  the  RRM,  much  less  is  known  about  the  structure  and  function  of  the  RS  and  RG  motifs 
or  other  auxiliary  domains  found  in  the  RBPs.  In  considering  the  possible  essential  function  of  RBM8 
proteins,  sequence  analysis  of  RBM8A  allowed  us  to  predict  the  presence  of  clusters  of  arginine/serine-rich 
and  arginine/glycine-rich  regions,  known  as  SR  and  RG  motifs,  respectively.  Furthermore,  RBM8A  has 
significant  homology  to  the  RG  and  SR  domains  described  for  other  human  members  of  the  RRM 
superfamily,  i.e.  nucleolin,  the  EWS  proto-oncogene  and  splicing  factors,  with  important  roles  in  mRNA 
biogenesis  and  rRNA  synthesis,  and  in  the  Ewing's  sarcoma  translocation  (Burd  and  Dreyfuss,  1994b; 
Delattre,  et  al.,  1992). 

An  increasing  number  of  proteins  appear  to  be  multifunctional,  participating  in  transcriptional  and 
post-transcriptional  events.  The  tumor  suppressor  WT1,  initially  considered  to  be  a  typical  transcription 
factor  (Hastie,  1994),  may  also  be  involved  in  splicing  (Lamond,  1995).  Interestingly,  the  N-terminus  of 
WT1  may  also  contain  a  cryptic  RRM,  discovered  through  molecular  modeling  (Kennedy,  et  al.,  1996). 
Furthermore,  recent  evidence  shows  that  WT1  co-localizes  and  is  physically  associated  with  splice  factors, 
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and  suggests  that  deregulation  of  splicing  may  be  a  crucial  factor  in  the  tumorigenesis  of  the  genitourinary 
system  (Davies,  et  al. ,  1999).  Aberrant  splicing  has  been  linked  to  the  early  and  intermediate  stages  of 
mammary  tumor  formation  (Stickeler,  et  al.,  1999),  and  a  deregulated  expression  balance  between  hnRNPs 
and  SR  factors  has  been  reported  for  human  colon  tumor  formation  and  metastasis  (Ghigna,  et  al.,  1998). 

The  importance  of  our  findings  is  reflected  in  the  recent  discoveries  that  several  human  and  other 
vertebrate  genetic  disorders  (Buckanovich  and  Darnell,  1997;  Dropcho  and  King,  1994;  Nishiyama,  et  al, 
1998)  are  caused  by  aberrant  expression  of  RNA-binding  proteins.  Furthermore,  recent  genetic  linkage 
studies  suggest  that  14q22-23  loci  may  be  implicated  in  the  etiology  of  human  cancers  and  other  diseases 
(Black,  et  al.,  1999;  Knuutila,  et  al.,  1999;  Zech,  et  al.,  1999).  The  conservation  of  RBM8A  sequences 
through  such  an  evolutionarily  divergent  group  of  organisms,  from  yeast  to  mice  and  human,  points  towards 
an  important  function  for  the  RBM8  family  of  proteins.  Our  work  also  provides  a  starting  point  for  further 
structural,  biochemical  and  genetic  studies  of  the  RBM8  protein  family  and  the  biological  relevance  of  its 
interaction  with  OVCA1  in  the  context  of  normal  and  abnormal  cell  growth. 

Our  studies  continue  to  suggest  that  OVCA1  has  certain  properties  that  are  in  common  with  a  number 
of  tumor  suppressors.  We  have  found  that  exogenous  expression  of  OVCA1  can  inhibit  tumor  cell  growth 
and  that  expression  of  the  protein  is  altered  in  both  breast  and  ovarian  tumors.  Yet  through  our  studies,  we 
have  not  been  able  to  establish  a  likely  function  for  OVCA1.  Therefore,  we  have  initiated  studies  in  mice  to 
evaluate  the  effect(s)  of  altering  OVCA1  (and  in  some  cases  OVCA2)  expression  on  normal  growth  and 
development.  By  establishing  such  models,  we  should  be  better  able  to  identify  the  function(s)  of  this  very 
unique,  but  highly  conserved  protein. 
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FIGURES 

Figure  1.  Schematic  of  the  OVCA  gene  locus. 

Figure  2.  Western  blot  analysis  of  OVCA1.  Two  polyclonal  antibodies  to  OVCA1  have  been  derived. 
The  N-terminal  antibodies  (TJ132)  was  directed  against  a  13  amino  acid  peptide  (sequence  highlighted  in 
blue),  while  the  C-terminal  antibodies  (FC22)  was  derived  against  a  fusion  protein  (sequence  highlighted  in 
yellow).  Lower  panel  demonstrates  that  antibodies  are  reactive  and  specific.  A2780,  ovarian  tumor  cell 
line,  HIO-1 17  and  -107,  human  ovarian  surface  epithelial  cell  lines. 

Figure  3.  Western  blot  analysis  of  OVCA1  using  C-  (left  panel)  and  N-terminal  (right  panel)  antibodies. 
Lower  center  panel  demonstrates  that  the  antibodies  detect  different  molecular  weight  species  of  OVCA1 
(50  kDa  or  48  kDa).  A2780,  ovarian  tumor  cell  line,  HIO-1 17  and  -107,  human  ovarian  surface  epithelial 
cell  lines. 

Figure  4.  Western  blot  analysis  of  p50OVCA1  and  p48OVCA1  in  subcellular  fractions.  A2780  cells  were 
separated  into  PI 00  and  SI 00  crude  fractions.  p50OVCA1  is  expressed  primarily  in  the  SI 00  fraction,  while 
p48OVCA1  is  expressed  predominantly  in  the  PI 00  fraction.  WCL,  whole  cell  lysate. 

Figure  5.  Two-dimensional  western  blot  analysis  of  OVCA1  in  293  cells.  Protein  extracts  from  293  cells 
stably  expressing  OVCA1  were  separated  by  2-D  gel  electrophoresis  and  OVCA1  was  detected  using  either 
the  C-terminal  or  N-terminal  antibodies.  Note  the  C-terminal  antibodies  do  not  detect  the  majority  of 
forms  recognized  by  the  N-terminal  antibodies. 

Figure  6.  Strategy  for  targeting  Ovcal.  The  structure  of  the  normal  allele,  targeting  vector 
pScrambler/Ovca,  and  targeted  allele  are  shown.  Blue  boxes  labeled  neo  and  DT-A  indicate  the  neomycin 
resistance  gene  and  diphtheria  toxin  a-subunit  gene  cassettes,  respectively.  Exons  are  shown  as  green  or 
red  filled  boxes,  exl,  ex2,  etc.,  for  Ovcal  or  Ovca2,  respectively,  whereas  other  genomic  segments  are 
shown  in  solid  lines.  Only  relevant  restriction  sites  are  shown:  H =HindIII,  Dv=DraIIL  Ea =EagI 
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The  OVCA1  gene  is  a  candidate  for  the  breast  and 
ovarian  tumor  suppressor  gene  at  chromosome 
17pl3.3.  To  help  determine  the  function(s)  of  OVCA1, 
we  used  a  yeast  two-hybrid  screening  approach  to 
identify  OVCAl-associating  proteins.  One  such  pro¬ 
tein,  which  we  initially  referred  to  as  BOV-1  (hinder  of 
OVCA1-1)  is  173  or  174  amino  acids  in  length  and  ap¬ 
pears  to  be  a  new  member  of  a  highly  conserved  RNA- 
binding  motif  (RBM)  protein  family  that  is  highly  con¬ 
served  evolutionarily.  Northern  blot  analysis  revealed 
that  BOV-1  is  ubiquitously  expressed  and  that  three 
distinct  messenger  RNA  species  are  expressed,  1-,  3.2-, 
and  5.8-kb  transcripts.  The  1-kb  transcript  is  the  most 
abundant  and  is  expressed  at  high  levels  in  the  testis, 
heart,  placenta,  spleen,  thymus,  and  lymphocytes.  Us¬ 
ing  fluorescence  in  situ  hybridization  and  the  5.8-kb 
complementary  DNA  probe,  we  determined  that 
BOV-1  maps  to  both  chromosome  5ql3-ql4  and  chro¬ 
mosome  14q22-q23.  Further  sequence  analysis  deter¬ 
mined  that  the  gene  coding  the  1-  and  the  3.2-kh  tran¬ 
scripts  (HGMW-approved  gene  symbol  RBM8A)  maps 
to  14q22-q23,  whereas  a  second  highly  related  gene 
coding  for  the  5.8-kb  transcript  resides  at  chromosome 
5ql3-ql4  (HGMW-approved  gene  symbol  RBM8B ).  The 
predicted  proteins  encoded  by  RBM8A  and  KBM8B  are 
identical  except  that  RBM8B  is  15  amino  acids  shorter 
at  its  N-terminus.  Molecular  modeling  of  the  RNA- 
binding  domain  of  RBM8A  and  RBM8B,  based  on  ho¬ 
mology  to  the  sex-lethal  protein  of  Drosophila,  identi¬ 
fies  conserved  residues  in  the  RBM8  protein  family 
that  are  likely  to  contact  RNA  in  a  protein-RNA  com¬ 
plex.  The  conservation  of  sequence  and  structure 
through  such  an  evolutionarily  divergent  group  of  or¬ 
ganisms  suggests  an  important  function  for  the  RBM8 
family  of  proteins.  C  2000  Academic  Pre.. 


Sequence  data  from  this  article  have  been  deposited  with  the 
EMBL/GenBank  Data  Libraries  under  Accession  Nos.  AF231511  and 
AF231512. 

1  To  whom  correspondence  should  be  addressed  at  Department  of 
Medical  Oncology,  Room  W304,  Fox  Chase  Cancer  Center,  7701 
Burholme  Avenue,  Philadelphia,  PA  19111.  Telephone:  (215)  728- 
2756.  Fax:  (215)  728-2741.  E-mail:  A  Godwin@fccc.edu. 


INTRODUCTION 

Analyses  of  ovarian,  breast,  and  other  human  tu¬ 
mors  indicate  that  allelic  loss  of  chromosome  17pl3.3  is 
one  of  the  most  frequently  observed  molecular  alter¬ 
ations.  Recent  data  from  our  laboratory  suggest  that 
the  ovarian  cancer  1  gene  ( OVCA1 )2  gene  is  a  strong 
candidate  for  the  breast  and  ovarian  tumor  suppressor 
gene  at  chromosome  17pl3.3,  yet  the  biochemical  func¬ 
tion  is  unknown  (Bruening  et  al.,  1999;  Schultz  et  al., 
1996).  To  help  determine  the  function(s)  of  OVCA1,  we 
used  a  yeast  two-hybrid  screening  approach  to  identify 
OVCAl-associating  proteins.  Several  proteins  were 
identified;  however,  the  most  common  was  a  protein 
containing  an  RNA-binding  motif. 

RNA-binding  proteins  (RBPs)  play  key  roles  in  the 
posttranscriptional  regulation  of  gene  expression  in 
eukaryotic  cells.  Once  produced  in  the  nucleus,  mes¬ 
senger  RNAs  (mRNAs)  are  transported  to  the  cyto¬ 
plasm  where  the  protein  synthesis  machinery  is  lo¬ 
cated,  and  there  is  an  array  of  posttranscriptional 
mechanisms  that  control  mRNA  stability,  localization, 
and  translation  (St  Johnston,  1995).  Many  of  these 
processes  are  mediated  by  RBPs  and  by  small  RNAs  as 
stable  ribonucleoprotein  (RNP)  complexes.  Character¬ 
ization  of  these  proteins  has  led  to  the  identification  of 
several  conserved  RNA-binding  motifs  (Bimey  et  al., 
1993;  Burd  and  Dreyfuss,  1994a).  The  most  notable 
feature  of  many  of  these  proteins  is  the  presence  of  one 
or  more  RNA-recognition  motifs  (RRMs).  The  RRM  is  a 
90-amino-acid  protein  domain  that  binds  single- 
stranded  RNA.  In  contrast  to  the  RRM,  much  less  is 
known  about  the  structure  and  function  of  other  RNA- 
binding  motifs,  including  the  arginine/serine-rich  (RS) 
and  arginine/glycine-rich  (RG)  regions.  We  report  here 
the  initial  characterization  and  structural  analysis  of 
RBM8A  and  a  closely  related  member  of  this  highly 
conserved  RNA-binding  motif  protein  family  referred 

!  Abbreviations  used:  OVCA1,  ovarian  cancer  1  gene;  BOV-1, 
binder  of  OVCA1-1  gene;  RBM,  RNA-binding  motif;  ORF,  open  read¬ 
ing  frame;  cDNA,  complementary  deoxyribonucleic  acid;  UTR,  un¬ 
translated  region;  FISH,  fluorescence  in  situ  hybridization. 
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to  as  RBM8B,  which  interact  with  a  candidate  tumor 
suppressor. 

v 

MATERIALS  AND  METHODS 

Yeast  two-hybrid,  interaction  trap  analysis.  The  two-hybrid 
screening  of  protein  interaction  was  performed  by  standard  protocols 
(Golemis  and  Serebriiskii,  1998)  using  a  human  fetal  brain  yeast 
expression  library  in  the  vector  pJG4-5  kindly  donated  by  Dr.  Erica 
Golemis  (Fox  Chase  Cancer  Center),  pMW103  as  the  LacZ  reporter, 
and  an  N-terminal  or  a  C-terminal  bait,  corresponding  to  amino 
acids  2  to  161  or  225  to  443  of  OVCA1  (i.e.,  LexA-OVCAla.a.2-161  or 
LexA-OVCAla.a.225-443),  respectively. 

Cloning  of  binder  ofOVCAl-1  (BOV-1).  The  entire  coding  regions 
of  BOV-la,  b,  and  c  were  obtained  by  using  an  EcoKl/Xhol  724-bp 
complementary  DNA  (cDNA)  fragment,  to  probe  directly  a  human 
fetal  brain  Uni-ZAP  XR  cDNA  library  (Stratagene,  La  Jolla,  CA)  by 
standard  approaches.  DNA  from  selected  phage  clones  was  se¬ 
quenced. 

Northern  analysis.  The  expression  of  BOV-1  mRNA  in  various 
human  tissues  was  determined  by  hybridization  of  an  ~800-bp 
BOV-1  cDNA  fragment  to  multiple  tissue  Northern  blots,  according 
to  protocols  previously  described  (Schultz  et  al.,  1996). 

Fluorescence  in  situ  hybridization  analysis  of  BOV-1  cDNA.  Flu¬ 
orescence  in  situ  hybridization  (FISH)  and  detection  of  immunoflu¬ 
orescence  were  carried  out  as  previously  described  (Bell  et  al.,  1995). 
A  5.8-kb  cDNA  clone  corresponding  to  BOV-lc  was  biotinylated  by 
standard  nick-translation  methods.  Probes  were  denatured  and  hy¬ 
bridized  to  metaphase  spreads  overnight  at  37°C.  Hybridized  probe 
was  detected  with  fluorescein-labeled  avidin  and  amplified  by  addi¬ 
tion  of  anti-avidin  antibody  (Oncor)  and  a  second  layer  of  fluorescein- 
labeled  avidin.  The  chromosome  preparations  were  counterstained 
with  DAPI  and  observed  with  a  Zeiss  Axiophot  epifluorescence  mi¬ 
croscope  equipped  with  a  cooled  charge-coupled  device  camera  (Pho¬ 
tometries,  Tucson,  AZ)  operated  by  a  Macintosh  computer  worksta¬ 
tion.  Digitized  images  of  DAPI  staining  and  FITC  signals  were 
captured,  pseudocolored,  and  merged  using  Oncor  version  1.6  soft¬ 
ware. 

Mammalian  cell  culture  and  transfection  procedures.  The  Afri¬ 
can  green  monkey  kidney  cell  line  COS-1  (American  Type  Culture 
Collection,  Manassas,  VA)  was  maintained  in  Dulbecco’s  modified 
Eagle’s  medium  (DMEM)  supplemented  with  10%  fetal  calf  serum. 
The  entire  coding  region  of  BOV-lalb  was  PCR-amplified  by  using 
oligonucleotides  containing  both  T7  epitope-tag  coding  sequences 
and  KpnVNotl  overhangs  and  directionally  cloned  into  the  KpnVNotl 
sites  of  the  pCDNA3  expression  vector  (InVitrogen,  Carlsbad,  CA), 
creating  a  full-length  in-frame  T7-BOV-1  fusion  protein.  COS-1  cells 
were  transiently  transfected  with  the  indicated  plasmid  using  Fu- 
Gene  (Roche),  as  directed  by  the  manufacturer,  and  used  for  immu¬ 
nofluorescence  studies. 

Immunofluorescence  microscopy.  COS-1  cells  were  seeded  onto 
coverslips  in  100-mm  dishes  and  grown  to  50%  confluence.  Forty- 
eight  hours  after  transfection,  cells  were  fixed  in  3%  paraformalde¬ 
hyde  and  then  permeabilized  with  2%  Triton  X  in  PBS,  After  incu¬ 
bation  with  monoclonal  antibodies  against  the  T7  tag  (Novagen, 
Madison,  WI),  cells  were  washed  and  then  incubated  with  a  FITC- 
conjugated  anti-mouse  antibody  (Jackson  ImmunoResearch  Labora¬ 
tories  Inc.,  West  Grove,  PA).  Slides  were  visualized  with  a  Nikon 
Eclipse  E800  epifluorescence  microscope,  and  immunofluorescence 
images  were  taken  with  a  Nikon  camera. 

Sequence  comparison  analysis.  Nucleic  acid  and  amino  acid  ho¬ 
mology  searches  were  conducted  by  using  the  NetBLAST  and  FastA 
programs  of  the  GCG  sequence  analysis  package  (Madison,  WI)  by 
searching  the  complete  combined  GenBank/EMBL  and  GenBank 
CDS  translations  +  SwissProt  +  PDB  +  SPupdate  +  PIR  data¬ 
banks,  respectively.  The  expressed  sequence  tag  database  at  NCBI 
was  searched  with  the  BLAST  program. 


Homology  modeling  of  the  RNA  recognition  motif  domain  of  BOV- 
1  /RBM8.  We  used  methods  described  previously  to  build  a  model  of 
the  RBM8  RRM  domain  (Dunbrack,  1999).  Briefly,  PSI-BLAST  was 
used  to  build  a  sequence  profile  of  the  RBM8  family  by  iteratively 
searching  the  nonredundant  protein  sequence  database  available 
from  NCBI  (Altschul  and  Koonin,  1998).  We  used  four  iterations  of 
PSI-BLAST,  and  only  sequences  with  expectation  values  better  than 
0.0001  were  included  in  the  sequence  profile  matrix.  Upon  comple¬ 
tion,  this  matrix  was  used  to  search  a  database  of  protein  sequences 
in  the  Protein  Data  Bank  (Berman  et  al.,  2000)  of  experimentally 
determined  protein  structures.  This  resulted  in  a  list  of  proteins  that 
could  be  used  as  a  template  for  modeling  of  RBM8A/B.  Models  of 
RBM8A/B  were  built  from  two  of  these  structures  using  the  side- 
chain  conformation  prediction  program  SCWRL  (Bower  et  al.,  1997). 
SCWRL  builds  side  chains  on  a  template  backbone  by  first  placing 
residues  according  to  a  backbone-dependent  rotamer  library  (Dun- 
brack  and  Cohen,  1997),  followed  by  a  combinatorial  search  to  re¬ 
move  steric  overlaps. 

RESULTS 

Isolation  of  BOV-1  cDNA 

We  used  a  yeast  two-hybrid  screen  to  identify  cDNAs 
from  a  human  fetal  brain  library  encoding  proteins 
that  were  able  to  interact  with  OVCA1.  A  C-terminal 
bait  of  OVCA1,  corresponding  to  amino  acids  225  to 
443,  yielded  the  only  protein  interactors.  A  total  of 
3.5  X  106  primary  transformants  were  screened,  re¬ 
sulting  in  the  identification  of  28  redundant  clones 
coding  for  4  OVCA1  interactor  candidates.  The  most 
redundant  clone,  initially  referred  to  as  BOV-1  (binder 
of  OVCA1-1),  accounted  alone  for  54%  of  the  total 
cDNA  isolated.  It  represented  15  independent  isolates 
of  a  cDNA  of  700  bp  in  length,  encoding  a  173-amino- 
acid  protein.  The  three  additional  candidate  OVCA1 
binders  were  BIP/GRP78  and  two  previously  unchar¬ 
acterized  proteins  (data  not  included). 

Northern  Analysis  of  BOV-1 

The  expression  pattern  of  BOV-1  mRNA  was  evalu¬ 
ated  by  multiple  tissue  Northern  blotting.  Three  major 
mRNA  species  were  detected,  BOV-la,  lb,  and  lc,  of 
~1,  ~3.2,  and  ~5.8  kb,  respectively.  While  these  spe¬ 
cies  were  expressed  in  all  tissues  to  varying  degrees, 
the  1-kb  transcript  was  most  abundant  in  testis,  heart, 
placenta,  spleen,  thymus,  and  lymphocytes  (Fig.  1). 
The  three  mRNA  species,  BOV-la,  lb,  and  lc,  could 
also  be  detected  in  mammalian  cell  lines  to  varying 
degrees  (data  not  shown). 

Cloning  of  BOV- 1  cDNA 

To  aid  in  the  characterization  of  BOV-1,  we  isolated 
30  cDNA  clones  from  a  human  fetal  brain  library  using 
a  random-primed  700-bp  cDNA  probe  and  sequenced 
14  of  them  to  determine  the  nucleotide  sequence  and 
predicted  amino  acid  sequence  of  BOV-la,  lb,  and  lc. 
Our  results  indicate  that  BOV-la  (the  abundant  ~l-kb 
transcript)  represents  the  entire  coding  region  identi¬ 
fied  through  the  yeast  two-hybrid  screen  and  that 
BOV-lb  results  from  the  use  of  an  alternative  polyad- 
enylation  signal  (Accession  No.  AF231511).  Based  on 
initial  sequence  analysis,  it  appears  that  BOV-lc  may 
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FIG.  1.  Tissue  expression  pattern  of  BOV-1  mRNA.  Blots  containing  5  pg  of  poly  (A)*  selected  mRNA  from  each  of  the  indicated  human 
tissues  were  hybridized  with  an  ~800-bp  BOV-la/b  cDNA  clone.  Size  standards  are  in  kilobases. 


be  the  product  of  alternative  exon  splicing  and  the  use  cDNAs  are  identical  except  for  the  5'-UTRs  (compare 
of  the  alternative  polyadenylation  signal  (Accession  sequences  for  Accession  Nos.  AF231511  and 
No.  AF231512).  AF231512). 

The  cDNA  encoding  BOV-la  consists  of  700  bp  in-  The  predicted  protein  encoded  by  BOV-lc  differs 
eluding  20  bp  of  5 '-untranslated  region  (UTR),  an  AUG  from  BOV-la/b  in  that  the  protein  is  predicted  to  be  15 
leading  into  an  open  reading  frame  (ORF)  of  173  amino  amino  acids  shorter  (see  Fig.  2;  translation  of  protein  re 
acids,  and  a  3'-UTR  of  161  bp.  The  cDNA  for  BOV-lb  encoded  by  BOV-lc  is  predicted  to  start  at  the  second 
differs  from  BOV-la  in  that  the  3'-UTR  is  substantially  methionine,  but  includes  an  additional  amino  acid  at 
longer,  2236  bp  versus  161  bp,  respectively.  The  cDNA  codon  27);  otherwise  the  proteins  are  identical.  The 
encoding  BOV-lc  consists  of  5786  bp  including  3074  bp  predicted  molecular  masses  for  the  BOV-la/b  and 
of  5'-UTR,  an  ORF  of  158  amino  acids,  and  a  3'-UTR  of  BOV-lc  proteins  are  20  and  18  kDa,  and  their  isoelec- 
2238  bp.  At  the  nucleotide  level,  BOV-lb  and  BOV-lc  trie  points  occur  at  pH  5.78  and  pH  7.62,  respectively. 
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FIG.  2.  RBM8A  structural  (A  and  B).  (A)  The  predicted  primary  sequence  of  the  protein  encoded  by  BOV-la/b  cDNA  is  shown,  and  the 
RNA-recognition  motif  (RRM)  is  underlined.  The  arrow  indicates  the  methionine  where  protein  encoded  by  BOV-lc  is  predicted  to  start.  (B) 
A  schematic  diagram  highlighting  structural  domains  of  BOV-la/b  is  presented  in  the  lower  panel.  The  RRM  and  both  RNP1  (residues 
113-120)  and  RNP2  (residues  74-79)  consensus  sequences  are  indicated.  A  putative  bipartite  nuclear  localization  signal  (NLS)  is  predicted 
to  be  present  at  the  N-terminus  of  BOV-la/b.  The  diagram  also  shows  the  arginine/glycine-rich  (RG)  box  (residues  151-162)  and  a 
serine/arginine-rich  (SR)  domain  (residues  163-173)  identified  in  the  C-terminus  of  BOV-la/b.  The  protein  encoded  by  BOV-lc  is  predicted 
to  be  16  amino  acids  shorter  than  BOV-la/b.  The  arrow  indicates  the  methionine  where  protein  encoded  by  RBM8B  is  predicted  to  start.  (C 
and  D)  Immunolocalization  of  BOV-la/b  in  COS-1  cells.  COS-1  cells  were  transiently  transfected  with  pcDN A3/T7 -B OV - 1 .  Forty-eight  hours 
after  transfection,  cells  were  fixed  and  stained  with  a  mouse  monoclonal  anti-T7  tag  antibody  (Novagen),  followed  by  staining  with 
FITC-conjugated  anti-mouse  antibody  (Jackson  Immunochemicals)  (C)  and  counterstaining  with  DAPI  (Sigma)  (D). 
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Comparison  of  12  of  the  14  BOV-1  cDNA  clones  iden¬ 
tified  a  sequence  variant  involving  codon  43  (GAA)  of 
BOV-la/b  (or  codon  27  of  BOV-lc).  In  66.7%  (8/12)  of 
the  clones,  this  additional  codon  was  present.  These 
transcripts  would  be  predicted  to  encode  a  protein  of 
174  amino  acids  (BOV-la/b).  In  comparison,  this  poly¬ 
morphism  was  not  detected  in  any  of  the  BOV-lc  cDNA 
clones. 

Chromosomal  Mapping  of  BOV-1  cDNA 

We  mapped  the  chromosomal  location  of  BOV-1  by 
FISH  using  the  5.8-kb  cDNA  probe,  corresponding  to 
the  BOV-lc  transcript.  Of  the  51  signals  observed,  24 
(47%)  hybridized  specifically  at  5ql3-ql4  in  19  of  the 
20  metaphase  spreads  scored.  In  11  of  20  (55%)  meta¬ 
phase  spreads,  signals  were  also  detected  on  chromo¬ 
some  14,  specifically  at  14q22-q23.  Sixteen  (31%)  of 
the  51  signals  observed  mapped  to  14q22-q23,  indicat¬ 
ing  that  two  closely  related  genes  may  exist  at  these 
jf3  two  sites  (Fig.  3). 

Nucleotide  Sequence  Analysis  of  BOV- 1  cDNAs 

Comparison  analysis  of  BOV-la/b  using  the 
BLASTN  program  demonstrated  99%  nucleotide  ho¬ 
mology  (score  =  898;  E  value  0.0)  to  a  human  EST  from 
a  colon  carcinoma  (HCC)  cell  line  cDNA  library  (Acces¬ 
sion  No.  AA30779).  Comparison  of  the  5'-UTR  of  the 
BOV-lc  cDNA  with  the  GenBank  databases  demon¬ 
strated  99.4%  nucleotide  homology  over  314  nt  (in  the 
reverse  orientation)  to  human  integrin-binding  protein 
Del-1  (Dell)  mRNA  (1712-1399  nt  of  Dell  and  2668- 
2981  nt  of  BOV-lc)  (Accession  Nos.  U70312  and 
AF231512).  The  3'-UTR  for  both  BOV-lb  and  BOV-lc 
also  shared  nucleotide  identity  with  a  human  cDNA 
(Accession  No.  AL049219)  (score  =  496;  E  =  1.0  X 
10  ~m)  identified  in  fetal  brain. 

Based  on  comparison  of  the  cDNA  sequence  for  BOV- 


la/b  and  available  genomic  sequence  (Accession  No. 
AF231511),  the  BOV-la/b  gene  appears  to  lack  introns, 
whereas  BOV-lc  contains  at  least  one  intron  (Acces¬ 
sion  No.  AF231512;  and  data  not  shown).  The  genomic 
sequencing  matching  the  BOV-la/b  cDNA  sequence 
was  placed  on  chromosome  14,  further  indicating  that 
BOV-la/b  (Accession  No.  AF231511)  and  BOV-lc  (Ac¬ 
cession  No.  AF231512)  are  distinct  genes  that  reside  on 
different  chromosomes.  Based  on  our  mapping  and  se¬ 
quence  data,  approved  names  for  BOV-la/b  and 
BOV-lc  have  recently  been  assigned  by  the  Human 
Gene  Nomenclature  Committee  (http://www.gene. 
ucl.ac.uk/nomenclature)  and  are  RBM8A  and  RBM8B, 
respectively. 

Protein  Sequence  Analysis  and  Subcellular 

Localization  of  BOV-1/RBM8 

The  deduced  primary  amino  acid  sequence  of 
RBM8A  and  RBM8B  indicates  the  presence  of  one  copy 
of  an  RNA-binding  domain  (RBD)  in  the  central  region 
(amino  acid  residues  71-148  or  55-132,  respectively), 
also  known  as  an  RRM  (Figs.  2A  and  2B).  This  RRM 
contains  one  set  of  the  two  consensus  nucleic  acid¬ 
binding  motifs,  RNP-1  (aa  113-120  for  RBM8A  and  aa 
97-104  for  RBM8B)  and  RNP-2  (aa  74-79  for  RBM8A 
and  aa  58-63  for  RBM8B),  which  are  characteristic  of 
the  heterogeneous  nuclear  ribonucleoprotein  (hnRNP) 
family  of  proteins.  The  RBM8A  and  RBM8B  amino 
acid  sequence  also  contains  a  putative  bipartite  nu¬ 
clear  localization  signal  (Robbins  et  al.,  1991)  at  the 
N-terminus  (aa  33-51  for  RBM8A  and  aa  17-35  for 
RBM8B)  and  a  stretch  rich  in  glycine  residues  (not 
shown).  Interestingly,  the  C-terminus  of  the  RBM8A 
and  RBM8B  proteins  (residues  151-173  and  135-157, 
respectively)  shows  significant  homology  to  the  serine/ 
arginine-rich  (SR)  domain  of  the  splicing  factor  SC35 
(Fu  and  Maniatis,  1992),  as  well  as  a  domain  rich  in 
glycine  and  arginine  (residues  151-162  for  RBM8A  and 


FIG.  3.  Chromosomal  localization  of  BOV-la/b  and  BOV-lc  by  FISH.  Partial  metaphase  spread  showing  specific  hybridization  signals  at 
chromosome  5ql3-ql4  (arrowheads).  Arrow  indicates  specific  hybridization  signals  at  chromosome  14q22-q23.  (Left  inset)  BOV-Ic-specific 
hybridization  at  5ql3-ql4  to  individual  chromosomes  from  other  metaphases.  (Right  inset)  BOV-lc- specific  hybridization  at  14q22-q23  to 
individual  chromosomes  from  other  metaphases. 


HUMAN  RBM8A  AND  RBM8B,  NOVEL  RNA-BINDING  MOTIF  PROTEINS 


residues  135-146  for  RBM8B),  reminiscent  of  the  RG 
box  described  in  human  nucleolin  (Nigg,  1997).  In  ad¬ 
dition,  we  analyzed  the  intracellular  distribution  of 
RBM8A  by  immunofluorescence  analysis.  Immunolo- 
calization  experiments  using  COS-1  cells  transfected 
with  a  T7-tagged  RBM8A  expression  vector  indicate 
that  full-length  RBM8A  is  localized  mostly  in  the  cell 
nucleus  and  is  finely  diffused  throughout  the  cyto¬ 
plasm  (Figs.  2C  and  2D). 

RBM8  proteins  appear  to  be  highly  conserved  evolu- 
tionarily.  Sequence  analysis  of  the  predicted  RBM8A 
and  RBM8B  coding  sequence  using  both  FastA  and 
TBLASTN  algorithms  revealed  that  Mus  musculus, 
zebrafish,  Xenopus  laevis,  Drosophila  melanogaster, 
Caenorhabditis  elegans,  and  Schizosaccharomyces 
pombe  encode  hypothetical  proteins  remarkably  simi¬ 
lar  to  RBM8A  and  RBM8B  at  the  amino  acid  level  (Fig. 

F4  4;  and  data  not  shown).  The  percentages  of  amino  acid 
identity  and  similarity  relative  to  RBM8A  are  indi¬ 
cated  in  the  legend  to  Fig.  4.  M.  musculus  hypothetical 
protein  (AL0022712)  is  11  amino  acids  shorter  than 
RBM8A  and  100%  identical  to  RBM8A,  while  zebrafish 
and  X.  laevis,  for  example,  encode  hypothetical  pro¬ 
teins  with  93.7  and  91.2%  identity  to  RBM8A,  respec¬ 
tively.  The  sequence  similarity  is  spread  throughout 
the  entire  amino  acid  sequence  and  is  most  pronounced 
in  the  RRM  region,  extending  from  amino  acid  residues 
70  to  150  of  the  RBM8A  sequence. 

RBM8  Protein  Model 

To  gain  more  information  on  the  RNA-binding  prop¬ 
erties  of  RBM8A,  we  built  a  three-dimensional  model  of 
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the  RRM  domain  of  RBM8A.  Sequence  analysis  using 
PSI-BLAST  (see  Materials  and  Methods)  indicated 
that  there  were  18  possible  template  structures  in  six 
families  in  the  Protein  Data  Bank  that  could  be  used  to 
build  the  RRM  domain  of  RBM8.  We  chose  two  of  these, 
the  sex-lethal  protein  from  Drosophila  (Sxl,  PDB  entry 
lb7f)  (Handa  et  al.,  1999)  and  the  poly(A)-binding  pro¬ 
tein  (PABP,  PDB  entry  lcvj)  (Deo  et  al.,  1999),  since 
both  of  these  sequences  could  be  aligned  with  RBM8A 
without  insertions  or  deletions.  The  resulting  sequence 
alignment  is  shown  in  Fig.  5A.  In  addition,  both  struc-  fs 
tures  contained  RNA  so  that  interations  between 
RBM8A  and  RNA  could  also  be  modeled. 

We  built  models  of  RBM8A  from  both  of  these  struc¬ 
tures  using  the  side-chain  conformation  prediction  pro¬ 
gram  SCWRL  (Bower  et  al.,  1997).  A  superposition  of 
the  backbones  of  these  two  models  is  shown  in  Fig.  5B. 
The  root-mean-square  (RMS)  deviation  of  the  back¬ 
bones  is  0.82  A.  The  sequence  identity  between  Sxl  and 
PABP  is  31%,  while  the  sequence  identity  between 
RBM8  and  Sxl  is  24%  and  that  between  RBM8A  and 
PABP  is  27%.  Since  these  sequence  identities  are  sim¬ 
ilar,  we  expect  that  the  model  of  the  RRM  of  RBM8A  is 
quite  accurate,  with  an  RMS  comparable  to  the  Sxl- 
PABP  RMS  of  0.82  A. 

In  Fig.  5C,  we  show  the  model  of  RBM8A  based  on 
Sxl.  Residues  that  bind  to  RNA  are  indicated  in  Fig. 
5C.  These  include  F75,  T77,  H102,  R107,  Fill,  Y115, 
L117,  and  P142.  Some  of  these  residues  are  identical  or 
similar  in  Sxl  and/or  PABP,  which  may  indicate  simi¬ 
lar  interactions  between  these  side  chains  and  RNA. 
For  example,  RBM8A  F75  is  a  tyrosine  in  PABP  and 
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FIG.  4.  RBM8A  protein  sequence  alignment  with  hypothetical  proteins  identified  in  M.  musculus  (100%  identity),  zebrafish  (93.7%/ 
98.7%),  X.  laevis  (91.2%/95.6%),  D.  melanogaster  (63%/76%),  C.  elegans  (60.5%/75.3%),  and  S.  pombe  (48.8%/64.5%),  using  the  UWGCG 
PileUp  program.  Percentages  of  identity/similarity  relative  to  RBM8A  are  indicated  in  parentheses.  GenBank  Accession  Nos.  AL0022712, 
AI943400,  AW200013,  AC006074,  CAA83626,  and  AL021813,  respectively.  White  letters  in  black  boxes  indicate  identical  residues.  Shading 
indicates  conserved  residues. 
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FIG.  6.  Homology  modeling  of  the  RNA  recognition  motif  domain  of  RBM8  A.  (A)  Sequence  alignment  of  RBM8A  RNA-binding  domain 
with  PABP  and  Sxl  from  PDB  entries  1CVJ  and  1B7F,  respectively.  (B)  Superposition  of  models  of  BOV-1/RBM8A  based  on  the  structures 
of  Sxl  (protein  in  purple  and  RNA  in  green)  and  PABP  (protein  in  yellow  and  RNA  in  red).  The  root-mean-square  deviation  of  the  protein 
backbone  Ca  coordinates  is  0.82  A.  (C)  Model  of  RBM8A  based  on  Sxl  in  a  complex  with  single-stranded  RNA  with  sequence  GUU- 
GUUUUUUUU.  Residues  that  contact  the  RNA  in  either  the  Sxl  or  the  PABP  models  are  indicated. 

forms  an  aromatic  ring-stacking  interaction  with  an  PABP  and  a  hydrogen  bond  to  a  uracil  ring  carbonyl  in 
adenine  in  the  RNA.  In  Sxl  this  residue  is  an  isoleu-  Sxl.  RBM8A  Ylll  is  also  a  tyrosine  in  Sxl,  which 
cine,  making  a  hydrophobic  interaction  with  a  uracil  makes  a  hydrogen  bond  with  a  uracil  carbonyl  in  Sxl. 
base.  RBM8AY115  is  a  tyrosine  in  both  Sxl  and  PABP,  While  it  is  not  possible  to  predict  the  RNA-binding 
making  a  ring-stacking  interaction  with  an  adenine  in  sequence  for  RBM8A  from  the  model,  it  is  clear  that 


i 


HUMAN  RBM8A  AND  RBM8B,  NOVEL  RNA-BINDING  MOTIF  PROTEINS 


7 


many  of  the  residues  typical  of  RNA-protein  interac¬ 
tions  in  this  family  of  proteins  are  contained  in  the 
RNA-binding  site  of  RBM8A.  Since  many  of  these  in¬ 
teractions  make  contact  with  RNA  bases,  mutations  of 
these  residues  are  likely  to  alter  the  binding  affinity  of 
RBM8A  for  its  natural  RNA  ligand. 

DISCUSSION 

Understanding  the  biological  function  of  the  ovarian 
tumor  suppressor  OVCA1  may  provide  new  insights 
into  how  breast  and  ovarian  cancers  develop.  To  dis¬ 
cover  important  clues  into  the  function  of  OVCA1,  we 
used  a  yeast  two-hybrid  interaction  trap  method  to 
identify  proteins  that  interact  with  OVCA1.  We  iden¬ 
tified  several  candidates  (Salicioni  et  al.,  unpublished 
results),  including  a  novel  protein  that  contains  a 
highly  conserved  RNA-binding  motif,  initially  referred 
to  as  BOV-1.  In  this  paper,  we  demonstrate  that  BOV- 
la/b  (renamed  RBM8A )  maps  to  chromosome  14q22- 
q23  and  that  the  closely  related  novel  gene  BOV-lc 
(renamed  RBM8B)  maps  to  chromosome  5ql3-ql4. 
Furthermore,  the  protein  encoded  by  RBM8B  is  iden¬ 
tical  to  RBM8A  except  that  it  is  16  amino  acids  shorter 
at  the  N-terminus  and  thus  appears  to  be  a  new  mem¬ 
ber  of  a  highly  conserved  RBM  protein  family.  Since 
both  RNA  transcripts  are  ubiquitously  expressed  in 
human  tissues  and  are  predicted  to  encode  a  protein,  it 
is  unlikely  that  either  gene  that  we  identified  is  a 
pseudogene.  Two  sequences  related  to  RBM8A,  (Acces¬ 
sion  Nos.  AF127761  and  AF161463)  were  released  in 
the  GenBank  database  while  this  article  was  in  prep¬ 
aration.  Our  results  are  further  supported  by  the 
knowledge  that  genes  coding  for  RNA-binding  proteins 
have  diversified  by  duplication  of  genes  and  intragenic 
domains  (Bandziulis  et  al.,  1989;  Bimey  et  al.,  1993).  A 
three-dimensional  model  of  the  RRM  domain  of 
RBM8A/B  indicates  that  these  RBM8  sequences  will 
fold  properly  into  an  RNA-binding  domain,  forming  a 
hydrophobic  core  between  a  /3-sheet  and  two  a-helices. 

In  eukaryotic  cells,  different  classes  of  RNA  are  syn¬ 
thesized  in  the  nucleus,  including  mRNA,  ribosomal 
RNA,  transfer  RNA  (tRNA),  and  small  nuclear  RNA, 
and  must  be  actively  exported  to  the  cytoplasm.  After 
the  transcription  of  tRNA  genes,  the  resulting  RNAs 
undergo  numerous  changes  before  a  mature  transla¬ 
tion-competent  species  is  produced.  These  changes 
have  been  found  to  include  terminal  processing,  intron 
splicing,  editing,  deamination,  and  addition  on  the  nu¬ 
cleotide  level  (see  review  by  Ibba  and  Soil  (1999)), 
many  of  which  are  mediated  by  RBPs  and  by  small 
RNAs  as  stable  RNP  complexes.  The  most  notable  fea¬ 
ture  of  the  RBPs  is  the  presence  of  one  or  more  copies 
of  the  RRM,  the  most  widely  found  and  best-character¬ 
ized  RNA-binding  motif  (Bimey  et  al.,  1993;  Burd  and 
Dreyfuss,  1994a).  We  have  determined  that  both 
RBM8A  and  RBM8B  have  only  one  copy  of  this  RRM 
with  one  set  of  the  two  consensus  nucleic  acid-binding 
motifs,  RNP-1  and  RNP-2,  which  are  characteristic  of 
the  hnRNP  family  of  proteins. 


The  RRM  is  the  only  RNA-binding  motif  for  which 
detailed  structural  information  is  available.  The  struc¬ 
tures  of  several  RRM  proteins  have  been  determined, 
and  all  consist  of  the  same  basic  protein  fold.  These 
include  the  NH2-terminal  RBD  of  U1  snRNP  A  (Tang 
and  Rosbash,  1996),  the  U1  domain  of  hnRNP  Al  (Xu 
et  al.,  1997),  the  single  RBD  of  hnRNP  C  (Gorlach  et  al., 
1994),  the  tandem  RRMs  of  the  sex-lethal  protein 
(SXL)  (Crowder  et  al.,  1999),  and  the  poly(A)-binding 
protein  (PABP)  (Deo  et  al.,  1999).  On  one  hand,  SXL 
governs  sexual  differentiation  and  dosage  compensa¬ 
tion  in  D.  melanogaster  by  binding  specific  RNA  tran¬ 
scripts,  while  it  is  known  that  the  mRNA  polyCA)  tail  is 
an  important  contributor  to  both  translation  initiation 
and  mRNA  stabilization/degradation  (Sachs  and 
Wahle,  1993).  We  have  built  models  of  RBM8A  based 
on  the  Sxl  and  PABP  stmctures.  These  models  identify 
residues  on  the  surface  of  the  sheet  that  are  likely  to 
bind  single-stranded  RNA  and  share  identity  in  other 
RNA-binding  domains  with  the  same  protein  fold. 
Since  many  of  these  interactions  make  contact  with 
RNA  bases,  mutations  of  these  residues  are  likely  to 
alter  the  binding  affinity  of  RBM8A  and  RBM8B  for  its 
natural  RNA  ligand. 

In  contrast  to  the  RRM,  much  less  is  known  about 
the  structure  and  function  of  the  RS  and  RG  motifs  or 
other  auxiliary  domains  found  in  the  RBPs.  In  consid¬ 
ering  the  possible  essential  function  of  RBM8  proteins, 
sequence  analysis  of  RBM8A  and  RBM8B  allowed  us  to 
predict  the  presence  of  clusters  of  arginine/serine-rich 
and  arginine/glycine-rich  regions.  Furthermore, 
RBM8A  and  RBM8B  have  significant  homology  to  the 
RG  and  SR  domains  described  for  other  human  mem¬ 
bers  of  the  RRM  superfamily,  i.e.,  nucleolin,  the  EWS 
proto-oncogene,  and  splicing  factors,  with  important 
roles  in  mRNA  biogenesis  and  rRNA  synthesis  and  in 
the  Ewing  sarcoma  translocation  (Burd  and  Dreyfuss, 
1994b;  Delattre  et  al.,  1992). 

An  increasing  number  of  proteins  appear  to  be  mul¬ 
tifunctional,  participating  in  transcriptional  and  post- 
transcriptional  events.  The  tumor  suppressor  WT1, 
initially  considered  to  be  a  typical  transcription  factor 
(Hastie,  1994),  may  also  be  involved  in  splicing  (Lam- 
ond,  1995).  Interestingly,  the  N-terminus  of  WT1  may 
also  contain  a  cryptic  RRM,  discovered  through  molec¬ 
ular  modeling  (Kennedy  et  al.,  1996).  Furthermore, 
recent  evidence  shows  that  WT1  colocalizes  and  is 
physically  associated  with  splice  factors  and  suggests 
that  deregulation  of  splicing  may  be  a  crucial  factor  in 
the  tumorigenesis  of  the  genitourinary  system  (Davies 
et  al.,  1999).  Aberrant  splicing  has  been  linked  to  the 
early  and  intermediate  stages  of  mammary  tumor  for¬ 
mation  (Stickeler  et  al.,  1999),  and  a  deregulated  ex¬ 
pression  balance  between  hnRNPs  and  SR  factors  has 
been  reported  for  human  colon  tumor  formation  and 
metastasis  (Ghigna  et  al.,  1998). 

The  importance  of  our  findings  is  reflected  in  the 
recent  discoveries  that  several  human  and  other  ver¬ 
tebrate  genetic  disorders  (Buckanovich  and  Darnell, 
1997;  Dropcho  and  King,  1994;  Nishiyama  et  al.,  1998) 
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are  caused  by  aberrant  expression  of  RNA-binding  pro¬ 
teins.  Furthermore,  recent  genetic  linkage  studies  sug¬ 
gest  that  both  5ql3-ql4  and  14q22-q23  loci  may  be 
implicated  in  the  etiology  of  human  cancers  and  other 
diseases  (Black  et  al.,  1999;  Knuutila  et  ah,  1999;  Zech 
et  al.,  1999).  The  conservation  of  RBM8A  and  RBM8B 
sequences  through  such  an  evolutionarily  divergent 
group  of  organisms,  from  yeast  to  mice  and  human, 
points  toward  an  important  function  for  the  RBM8 
family  of  proteins.  Our  work  also  provides  a  starting 
point  for  further  structural,  biochemical,  and  genetic 
studies  of  the  RBM8  protein  family  and  the  biological 
relevance  of  its  interaction  with  OVCA1  in  the  context 
of  normal  and  abnormal  cell  growth. 
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